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ABSTRACT 
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SUMMARY OF BINARY ADDERS 
by 
F. B. Hartman 



The circuit technology used in the construction of a data processing system 
forms an important factor in determining its operating characteristics . 
The set of basic building blocks provided can be arranged in many forms to 
realize the functional requirements of the system. In designing for high 
speed operation, it is the duty of the circuit designer to provide fast circuits. 
This will guarantee a generally fast system. On the other hand, it is the duty 
of the logic designer to specify from among numerous alternatives those ar- 
rangements of the circuits that will facilitate the realization of high system 
speed consistent with reasonable cost. 

System considerations often dictate the use of binary arithmetic, particularly 
when the system has to solve scientific problems. Speed and accuracy con- 
siderations dictate that the binary digits be handled in parallel as much as 
possible and with a sufficiently long word length. Since all arithmetic opera- 
tions reduce ultimately to addition, the design of a fast parallel binary adder 
has a particularly important bearing on achieving high speed in the system 
containing it. 

Analyses of binary adders have appeared extensively in the literature. 
Richard's book* contains a general survey. Campbell and Rosser discuss 
the carry transmission problem and means for speeding up adders by various 
strategies. Gilchrist, Pomerene and Wong 1 ' discuss details of an asynchron- 
ous adder. Mercer* discusses a form of "carry save" adder in conjunction 
with a micro-program computer structure. 

The purpose of this section consists in discussing the various forms of binary 
adders from a general viewpoint, including a summary of the data from the 
above sources. 

Forms of Binary Full Adders - As a base, the design of any binary adder 
begins with a consideration of the single -position binary full adder. The 
actual form used will depend upon the circuit technology involved, but in 
Figure 1, we have shown several representative forms. 

The full adder of Figure la uses the exclusive OR block and can be considered 
as two half adders in cascade. A single half adder (shown within the upper 
rectangle) generates the sum and carry of the operand bits, A and B. The 
sum line (P) from this circuit is added to the "carry in" signal C. in the other 
half adder. A carry out of either half adder or both will generate a "carry out" 
signal (C ) for the full addition in the inclusive OR circuit. Attention is directed 
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Figure 1 FORMS OF BINARY FULL ADDERS 

1 











V 


PI 






A - 








V 






' 1 " 


B " 




























I 








A 


G 


1 

1 
1 






i_ 
















1 » 










I 




A 


*■ 











1 " 




1 


r . _ 


I 


c i 


w i ■■ 

l_ 


i 












J 



(a) Using two Half Adders in Cascade 
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to the circuitry in the lower dotted rectangle which shows the method of 
generating the "carry out" signal. A "carry out" of unity can occur in 
two distinguishable ways. In the first case, if both operand bits A and 
B are unity, the "generate" line (G) is unity, which forces a "carry out" 
through the OR circuit. In this case, the full adder is said to "generate" 
a carry. In the second case, if A and 3 have opposite values, the lower 
AND circuit will be conditioned by the "propagate" line (P) to transfer 
the value of the "carry in" line directly to the output, since at this time 
the "generate" line (G) is down and the OR circuit looks like a direct 
connection to the other input. In this case the adder is said to "propagate" 
the "carry in. " On the other hand, if both A and B have a zero value, the 
stage neither generates nor propagates a carry. 

Analysis will show that, insofar as the generation of the "carry out" is 
concerned, the propagate signal can be taken as the inclusive OR rather 
than the exclusive OR. This arrangement is shown in Figure lb, which 
shows also the generation of the sum in terms of the "carry out." This 
form has been used in commercial equipment and has the advantage that 
complementary input signals are not needed and but one inverter is used. 

Figure lc shows a special form of full adder in which the complements 
of the "propagate" and "generate" signals are generated internally, and 
used in conjunction with complementary "carry in" signals to generate the 
sum and complementary "carry out" signals. (If complementary "carry 
in" signals are used, complementary "carry out" signals will be needed 
in some applications). 

Many other forms can be imagined, but unless complementary inputs for 
each of the three bits is provided, some form of inversion action will be 
required internally, and this implies some form of amplifier. 

Basic Forms of Binary Adders - If the binary words consist of single 
digits, the problem reduces to the design of the binary full adder dis- 
cussed above. For longer word lengths, several distinct types of adder 
have appeared. Figure Ea shows a serial binary adder which can add 
two binary words of any length one position at a time. Its distinguishing 
feature consists of the carry flip-flop which serves to store the "carry 
out" from one cycle of the addition in order to present it as the "carry 
in" of the next cycle. The operand bits must be presented serially, one 
lined-up pair per cycle, starting with the lowest-weight position. The 
sum bits are carried off in similar fashion to some storage device. 
Letting n represent the length of the operand in bits, t, the delay of the 
binary full adder in generating the sum or"carry out," whichever appears 
later, and t-j the resolution time of the flip-flop (time required for enter- 
ing the carry outV, then the shortest cycle possible equals t, + to, and for 
the time T to add any pair of words we have 

T = n(t x + t 3 ) (Serial Adder) (1) 
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Figure 2 



TWO BASIC FORMS OF ADDERS 



A 
B 















C| 








Carry 
Flip — 

Flop 
























FA 

(Fig 1a 

or 1b) 


Co T 
















b 




w 






(a) Serial Adder 





'01 




(b) Linear- Propagate Adder 



Page 5 



Figure 2b shows a parallel linear -propagate adder for adding four-bit words. 
Its characteristic feature consists in the connection of the "carry out" of one 
binary full adder to the "carry in" terminal of the next one in order. The 
operand bits are presented simultaneously to each position and the sum bits 
removed simultaneously. Because of the random nature of the bits of the 
operands, the actual delay will be random and will depend upon the maxi- 
mum length that carries have to propagate. 

A carry sequence of length j will be said to be present, if a carry is gen- 
erated in some stage (T(by convention, cr s 1 represents the highest -weight 
position and (T- n the lowest -weight position), the next j-1 positions to the 
left are set to propagate, and position <T -j is not set to propagate. (Here 
we take "propagate" to mean the exclusive OR of the operand inputs rather 
than the inclusive OR). For each stage that generates a carry, there will 
be a corresponding carry sequence of length greater than zero. In linear- 
propagate adders, enough time has to be allowed for the maximum possible 
length of carry sequence, which is seen to involve the generation of a carry 
in the lowest -order position, with all other positions set to propagate. 
Thus the worst case involves a carry sequence of length n, the length of the 
operands. Letting tj represent the time for the "propagate" and "generate" 
signals to settle in all orders plus the time needed to generate the sum 
(given the carry in); letting t;? represent the time to propagate a carry 
through a stage (given the "generate" and "propagate" signals), and letting 
t3 represent the resolution time of the flip-flops which constitute the sum 
register, the complete period needed to form the sum and place it in the sum 
register under worst-case conditions is given by 

T = tj + ntp + to (Linear-Propagate Adder) (2) 

The forms of the serial adder and the parallel linear-propagate adder are 
such that additions proceed at a rate of 1/T, where T is given by equations 
(1) and (2) respectively. 

Two forms of variable-speed adders, called asynchronous adders, are 
shown in Figure 3. In both cases, the adder must be in a reference condi- 
tion prior to the beginning of the addition, such that the lines C and Cq 
in each position are both zero. In the adder of Figure 3a, this is performed 
by holding down the Start Add line for a time equal roughly to the time needed 
to propagate a carry through one stage, say t^. With the adder in its refer- 
ence condition, the addition is started by setting Start Add to unity. After a 
certain time, depending upon the operand magnitudes, one or the other (but 
not both) of the lines C and C Q will be at unity for each position of the adder. 
The inclusive OR of these lines is formed for each position and connected to 
an n-way AND circuit, the output of which will be unity when the add is com- 
plete. This line can be used to cause the transfer of the sum to the sum reg- 
ister and after this is done, the adder can be reset to its reference condition 
for the next add operation. Internal race conditions leading to transient 
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Figure 3 
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hazards on the carry lines may exist at the start of the add operation, 
but so long as a minimum time delay is enforced for the operation (as 
for example by means of the delay line shown) these should not cause 
error. After this initial delay, the appearance of a one on either the 
C line or the C ' line will indicate that the "carry in" for that position 
has been received, or else internally generated or inhibited (both operand 
bits at zero), By the time the End of Add signal comes up, the sum lines 
should be settled to their final value. Thus, letting tj represent the mini- 
mum time allowed for the add operation (no carries propagated), assuming 
the time needed to reset the adder equals tj, and letting tj represent the 
resolution time of the flip-flops of the sum register, we find that the ex- 
ecution time T of this adder is bounded as follows: 

2t. + t 3 = T— 2t + nt 2 + t 3 , (Asynchronous Adder) (3) 

where n is the length of the operands. 

The actual value of T will depend upon the distribution of bits in the 
operands, hence the term "self-timed" is used to refer to asynchronous 
adders. The operands are random and some assumption as to the prob- 
ability distributions has to be made in order to calculate the probability 
distribution of the maximum length carry sequence. For this type of 
adder, one must consider the propagation of zero carries in addition to 
the propagation of one carries in determining the maximum length carry 
sequence. Assuming a uniform distribution of operand bits ( a one or zero 
in each position is equally probable), Gilchrist, Pomerene and Wong 3 show 
that the mean of the maximum- length carry sequence in adding 40-bit words 
equals 5. 6. 

In the Appendix of this section, we show that the expected maximum length 
carry sequence has an upper bound of Qog2nH + 2, where the brackets 
mean to take the integral portion of the expression enclosed, and n is the 
number of bits in the operands. Using this expression, the average add 
time for the asynchronous adder is bounded as follows: 

Tavg == 2t x + / log 2 n + 2\t 2 + t 3 . (4) 

(Asynchronous Adder) 

Figure 3b shows an adder similar to the one in Figure 3a except for the 
use of a pair of binary full adders in each position and the use of exclusive 
OR circuits. The exclusive OR circuits are used not only for the sensing 
of carry completion, but also for sensing of sum completion and error de- 
tection. If any sum or "carry out" is in error, the "End of Add" signal 
cannot come up. The same relations govern the time of executions as hold 
in the case already considered. 
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Another form of variable delay adder is shown in Figure 4. The dis- 
tinguishing features of this adder are (1) each stage involves only a half 
adder, and (2) the "carry out" from each order is stored (or "saved") in 
a flip-flop and introduced as a "carry in" to the next order in the next 
cycle. To start, the two operands are added and the sum and carry words 
both stored in registers. In the next cycle, the numbers in the sum and 
carry registers are added and the revised sum and carry words formed. 
This is continued until the carry register contains all zeros. The opera- 
tion is essentially synchronous, but a variable number of cycles is needed 
in order to propagate the carries. Letting tj represent the time needed to 
gate in operands and to form the sum or carry, whichever appears later; 
letting t represent the resolution time of the flip-flops of the sum and 
carry registers, and allowing one cycle for the recognition of the "End of 
Add! 1 signal, the execution time is given by 

2(t x + t 3 )^ T— (n + 2) (t x + t 3 ) (Carry Save Adder) (5) 

The actual execution time will depend upon the distribution of one's and 
zero's in the operand words. In this case, only the propagation of one's 
needs to be considered. The Appendix shows that the average of the maxi- 
mum length of carry sequence is bounded by [logjjnQ+l. Hence, the aver- 
age execution time is bounded as follows: 

Tavg = log 2 n + 3 (tj + t 3 ). (Carry Save Adder) (6) 

Carry Transformations - The length of the random maximum-length carry 
sequence forms a principal determinant of the speed of the adders consid- 
ered above. Except for the serial and linear- propagate adders, each of 
these could take advantage of the relatively low average of the maximum- 
length carry sequence. In the linear-propagate adder, enough time has to 
be allowed for the maximum possible length of carry sequence. This situa- 
tion has led to numerous strategies for speeding up the carry propagation 
by transforming those portions of the adder which have the specific function 
of transmitting the carry to the next position. For the individual stage, 
these are shown in dotted rectangles in Figure 1 for the various forms of 
binary adder considered. 

Consider a linear-propagate adder with n stages, numbered from 1 to n 
from left to right (high order to low order). Letting C ., G. and P. repre- 
sent the "carry out", "generate" and "propagate" signals, respectively, 
of the j-th stage, letting C (n + 1) represent the carry into the adder 

(C n i , ,. _ r >; then the following set of equations represent the 
\_ o ^n + x) - HLn-/ 
carry propagation circuitry: 

C oj =G j + P J C o (j+ l)^ s1 ' 2,....,n) (7) 
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Figure 4 CARRY SAVE ADDER 
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In words, the "carry out" of the j-th stage will occur if a carry is 
"generated" within the j-th stage or if a "carry out" appears from the 
(j + l)-th stage and the j-th stage is set to "propagate". Considering 
only the first four stages, the first four of the above set of equations 
can be combined to form 

C 1 = G 1 + PlG 2 + PlP 2 G 3 + P l P 2 P 3 G 4 +P l P 2 P 3 P 4 C o5- < 8 > 

Circuitry to realize this equation in two levels of logic will be called a 
Simultaneous Look Ahead Carry over stages 1 through 4, abbreviated 
SLAC 1-4. It is used in many of the adder forms to be discussed 
presently. 

Another form of carry transformation consists in keeping the carry 
propagation circuitry shown in Figure 1 for the individual stages, but 
adding circuitry to speed up the propagation over a group of stages. 
Considering again stages 1-4, the equation corresponding to the added 
circuitry has the form 

C , a C* + P. P P.P. C , (9) 

ol ol 1 2 3 4 o5 

where C*, represents the "carry out" generated by the stage itself. 
In applying this transformation, the actual carry out for the group is 
taken from the added circuit. This form of circuit will be called an 
ordinary carry look ahead over stages 1 through 4, ' abbreviated as LA.C 
1-4. It is assumed to be formed in two levels of logic. 

The forms of circuitry represented by equations (8) and (9) can be formed 
for any group of adjacent stages within the adder. Upon choosing the 
groups and the tupe of lookahead to be used for each group, one establishes 
a carry transformation, by which a carry sequence of length j originating 
at some position of the adder is executed in a time less than or equal to 
that required for the same carry sequence in the linear-propagate adder. 
In order to correlate the present discussion with the analysis of Campbell 
and Rosser' 2 , we shall assume that the delay through a lookahead circuit 
is equal to the time required to propagate a carry through one stage of the 
linear-propagate adder, regardless of the number of stages involved in the 
lookahead. In some technologies, the actual delays will be somewhat 
longer because the lookahead circuit has more inputs, so that the assump- 
tion may not strictly hold. With this assumption, the delay through a 
lookahead circuit equals the previously- defined t2« 

A simple form of carry transformation, resulting in what we shall call 
the "linear -grouped" adder is shown in Figure 5. In this transformation, 
the entire adder is divided into groups, at least one of which contains more 
than one stage. We have shown a four-bit adder divided into two groups of 
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two stages each. Carries are. propagated linearly within a group, but 
are transferred by carry lookahead around the group, should the need 
arise. An analysis of the time delay involved is facilitated if the adder 
retains an iterative structure, as for example by specifying equal-sized 
groups. Because of the assumption on the delay of the lookahead cir- 
cuitry, a carry will propagate across a group in the time it takes to 
propagate over one stage within a group. Letting j represent the number 
of units of time for propagating a carry sequence of length j in the linear- 
propagate adder, letting R represent the number of units of time for 
propagating the same carry sequence through the linear-grouped adder, 
letting m denote the number of stages in one group, and letting (T repre- 
sent the position within a group at which the carry originates {Q^- 1, 2, 

, m; counting from left to right in a group), we see that R depends 

upon the length of the carry sequence involved and the position within a 
group at which the carry sequence originates. Thus, we write 

R = f(j.<T). (10) 

The function f can be determined by inspection of the circuitry in each 
case. In doing this for the example of Figure 5, assume that the "carry 
out" of the adder is connected to the "carry in" of the adder, which is the 
actual situation which arises in subtraction. With this symmetry, it is 
easy to verify that the function f for this case is as given in Table I, (p. 16). 
The maximum number of units of time required is seen to be 3. This is an 
Improvement over the linear -propagate adder which, requires 4 units in the 
worst case. 

Campbell and Rosser** contains an analysis of the general case which shows 
that with m^> 2, the number of units of time needed in the worst case is given 
by 

R max = n / m + 2m " 3 > ( m ^> 2 ) » (11) 

which arises when j = n- 1 and CT - m (n/m equals the number of groups in 
the n-bit adder). Thus the linear-grouped adder has a worst-case execution 
time of 

R = tj + (n/m + 2m - 3) t 2 + t 3 (Linear-Grouped Adder) 

(12) 

Simultaneous Adder - A different sort of carry transformation is used in 
the adder as shown in Figure 6. Once the "generate" and "propagate" sig- 
nals have settled for all stages, the "carry in" to each stage and the "carry 
out" of the adder as a whole is formed in the time required for the linear- 
propagate adder to transmit the carry over one stage. The full adders 
(denoted by FA) do not need the internal carry generating circuit, and this 

is indicated by the asterisk. Thus, the simultaneous adder has an execution 

time given by 

T = tj + t 2 + t 3 ; (Simultaneous Adder) (13) 
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Figure 6 
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A design strategy related to the use of a simultaneous adder consists in 
dividing an n-bit adder into groups of simultaneous adders (m stages in 
each group) linearly connected. This scheme is illustrated in Figure 7. 
An analysis in Campbell and Rosser 2 for equal size sub-adders shows 
that N, the number of units of time needed to propagate a carry sequence 
depends upon j and (T~ . For m^.j ^ n, N is given by 



N-3-j^.l + [«-/"- 1) /™] ■ 



(14) 



where ,W\ 1 represents the Kronecker delta { £ p. j = 1 if ^~~ *» otner - 
wise zero), and the brackets mean to take the integer part only. Taking 
again the case of m > 2, we see that N is given by (n/m) + 2. Thus, 

for execution time, we have for the worst case 

T =t + ( (n/m) + 2) t- + t 3 (Linear Simultaneous Adder) (15) 

More General Forms of Carry Transformation - By considering each of 
the above forms of adder as simply a single group in a larger adder, we 
arrive at more general carry transformations. Thus, in Figure 8 we show 
a so-called "grouped-grouped" adder; in Figure 9, a "grouped- simultaneous" 
adder; in Figure 10, a "simultaneous-grouped" adder; and in Figure 11, a 
"simultaneous-simultaneous" adder. In each case, the first part of the 
name designates the manner of carry propagation involved in connecting 
the main groups. The second part designates the type of adder which forms 
one group. 

A means for saving equipment in the Simultaneous-Grouped Adder is shown 
in Figure 10. Rather than have the "generate" and "propagate" signals feed 
from each stage to the SLAC circuits, a "generate" and a "propagate" signal 
is first formed for each of the linear-grouped adders as a whole, in the cir- 
cuits labelled "G and P". Considering stages one through four, we have the 
equations 

G l-4 = G l + P i G 2 + P l P 2 G 3 + P 1 P 2 P 3 G 4' (l6) 

P l-4 = P 1 P 2 P 3 P 4* (1?) 

which show the structure of the circuit. This strategy keeps the number of 
inputs to the largest SLAC circuit low at the expense of some additional time 
delay in the "G and P" circuits. 

The number of levels of grouping can be extended to any desirable number of 
levels. A 100-bit adder involving three levels of grouping is shown in Figure 
12. This could be called a "Simultaneous-Simultaneous-Simultaneous" Adder, 
but at these levels of complexity, the names become cumbersome. This form 
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Figure 8 
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Figure 9 
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Figure 11 SIMULTANEOUS SIMULTANEOUS ADDER 
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of adder is used in the Sigma Computing System. 

Evidently many possible carry transformations exist. In any specific case, 
the method of analysis set forther in Gampbell and Rosser^ should enable 
the logic designer to derive the carry transformation involved, and thus the 
worst case delay. 

Other Possibilities - Each of the carry transformations considered above 
can be considered as variations on the linear- propagate adder. As such, 
the rate of operation of any of these adders can be no higher than that 
determined by the worst case carry situation. This contrasts sharply with 
the variable -speed adders such as the asynchronous and carry save adder 
schemes discussed previously, inthat those of the latter group have an 
average speed which depends upon the average carry situation. We wish to 
point out, however, that the various basic methods of parallel addition can 
be combined in numerous ways. One could, for example, form a "carry- 
save grouped" adder in which the carries are propagated by some form of 
carry propagation within a group, but the "carries out*of a group are saver" 
in a carry register for presentation as "carries in* during the next cycle. 
One could on the other hand have a "grouped- carry- save" adder in which 
each individual group forms a carry- save adder, but in which a look ahead 
carry circuit passes the carry around the group should the need arise. In 
this case, once a carry has been propagated across a group, means must 
be provided to prevent the "carry out* of the highest order stage from develop- 
ing a group carry later on when the carry being propagated reaches it through 
the carry save feature. 

As another possibility, any of several of the carry transformations discussed 
above can be combined with either form of asynchronous adder, by providing 
similar carry transformations on the zero carry lines. One could have a 
"linear- grouped asynchronous" adder, "linear- simultaneous asynchronous" 
adder, etc. Note, however, that it would be fruitless to specify a "simultane- 
ous -asychronous" adder, since by equation (13) the speed of the adder will be 
as given if any carry is generated, and will be less by the amount t2 only if 
no carries are generated. Thus, the delay introduced by the asynchronous 
connections would probably exceed t2» which is the only time interval that 
could possibly be reduced. 

As a final example of other possibilities, consider the following design 
strategy. First, determine from probability considerations that maximum 
length of carry sequence which will be exceeded in a small fraction of the 
additions, say 10%. Design a linear- simultaneous adder with this size 
grouping, and let its execution time equal tj at most for the majority (90%) 
of the cases. Provide additional carry transformations (say by transforming 
to a synchronous- synchronous adder) such that the worst of all possible carry 
sequencesgives an execution time not exceeding 2tj. Finally, provide circuits 
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to recognize whether or not the carry sequence exceeds the size of the 
basic group, and design it fast enough to operate in a time tj. Then the 
resulting adder can be made variable speed. In the majority of cases 
(90%) the execution time will equal t . In the other 10% of the cases, 
the execution time will equal 2tj. Whether to take one or two cycles 
will be decided by the recognition circuitry, which will operate fast 
enough. The average time will equal .9 (tj) + . 1 (2tj) or 1. ltj. The 
main feature of this arrangement consists of the recognition circuitry 
which is apt to involve considerable equipment, but with it one may 
almost double the speed attainable. 

Summary - We have shown several basic forms of binary adders, 
several variations of the linear -propagate adder obtained by carry 
transformations, and mentioned a number of mixed forms. The logic 
designer has here a catalog of many possible forms. In deciding which 
to use in a particular application, his choice may be determined in part 
by considerations of the circuit technology involved. For example, in 
some technologies the delay through an AND-OR stage of logic may in- 
crease with the number of inputs, so that the LAC, SLAC and "P and G" 
circuits may not have the speeds assumed in this paper. Under such 
conditions, the increase in speed obtainable by carry transformations 
may not be as large as the theoretical figures show. Detailed considera- 
tions of this kind as applied to adders are beyond the scope of the present 
section. However, the reader should remember that, important circuit 
factors enter into the consideration of each case, and so there is more to 
designing an adder than just picking one of the arrangements illustrated 
here. 
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APPENDIX 
Upper Bounds to the Average of the Longest Carry Sequence 



Starting from simple assumptions upon the probability distributions of 
the two n-bit binary numbers that are to be added, we derive a recur- 
sive expression for the probability distribution function of the longest 
carry sequence. Neglecting certain terms in this expression that tend 
to reduce the values of the probabilities and manipulating the expression, 
we convert it to an inequality. Using this inequality in the formula for 
the "mean of the longest carry sequence, " we derive an upper bound. 
Two cases are considered. In the first case, which applies to asynchron- 
ous adders, the propagation of "ones" and "zeros" is considered, and a 
value of [log n + 2 | is derived as an upper bound to the average of the 
longest carry sequence. In the second case, which applies to carry save 
adders, the propagation of "ones" only is considered, and an upper bound 
of jlog^n + l) is derived for the average. (Note that the brackets 
indicate to take the integer portion only of these expressions). 

We assume that each bit in each word has a value independent of the 
values of all other bits and for each bit, a "one" or a "zero" value is 
equally likely and occurs with a probability of 1/2. 

Considering a pair of operand bits in any position, we find four possible 
pairs of values: 00, 01, 10, and 11, each occurring with a probability of 
1/4. 

We distinguish two types of carry propagation: (1) Both "ones" and "zeros" 
are propagated and (2) only "ones" are propagated. In the first case, which 
applies to asynchronous adders, a "one" carry will be "generated" if the 
combination 11 occurs, and a "zero" carry will be "generated" if the com- 
bination 00 occurs. Thus the probability of "generating" a carry of either 
kind equals 1/2. In the second case, which applies to "carry save" adders, 
a "carry" will be generated if and only if the combination 11 appears, and 
this occurs with a probability of 1/4. In either case, if the two operand 
bits have opposite values (combinations 10 or 01), then any carry will be 
"propagated" through that stage and the probability of "propagation", equals 
1/2. 

Let C. and C represent the event that a type (1) or type (2) carry 
respectively, is generated in some stage of the addition, is propagated 
through the next (j-1) positions, and enters the next stage thereafter (which 
may or may not be set to propagate). We shall call such an event a carry 
sequence of length j (of types (1) and (2), respectively). 

The probability of each of these events consists of the product of the sub- 
events which by conjunction make up the event, since the bit values for 
the individual stages are independent. 
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Hence we have 

Prob (C.) = Prob (E, and F, and •• -andF. ,)=Prob(E). 
lj 1 i j-1 J- 

Prob (Fj) Prob (F^j) (1) 

where E, represents the event of generating a type 1 carry in some position, 
and where F,, F, • • • , F. represent the events of propagating a carry 
through the first, second • • •, and (j-l)-th positions, respectively, to the 
left of the one in which the carry is generated. Substituting the known values 
for the probabilities of these events, we find 

ProbtCjj) = 1/2 j (2) 

In a similar manner we find 

Prob (C 2j ) = 1/2 j + X (3) 

These probabilities enter into the formulas for expressing the probability 
distributions of the longest carry sequence for the two types of propagation. 

Let P (v) represent (for either type 1 or type 2 propagation) the probability 
that the longest carry sequence (1. c. s.) which occurs during the addition of 
two n-bit numbers, will equal or exceed the given amount, v. This event 
will be represented by (1. c. s.= v) or by E n v . We can show that this event 
can occur in two mutually exclusive ways. in. the first case, the lowest 
order (n-1) positions will already contain a longest carry sequence that 
equals or exceeds v. This event is symbolized by (1. c. s. / n _ j» zrz v) or 
by E , .In the alternative case, the highest-order position is set to 
propagate, the adjacent v- 1 positions to the right contain a carry sequence 
of exactly (v-1), and the lowest-order (n-v) positions do not contain a long- 
est carry sequence which exceeds or equals v. The alternative case is thus 
the conjunction of three events, symbolized by F,, C/ y _j\, and E'. . 
where Fj is the event "first stage propagates , " C» ,» the event ' 

"carry sequence of length (v-1) occurs," and the prime indicates the absence 

of the event E 

n-v, V 

With this analysis we see that 

Prob (E n> y ) = Prob [(E^ y ) or Qrj and (C^) and Vv) ,}] (4) 

where the "or" is to be taken in the exclusive sense. From probability 
theory, we know that the probability of an exclusive "sum" of events is 
equal to the sum of the probabilities of the events. For a conjunction of 
independent events, the probability equals the products of the probabilities 
of the individual events. Finally, the probability of the negation of the event 



Page 23 



is equal to (the probability of that event) subtracted from (unity). Applying 
these values to equation (4), we obtain 

Prob (E n§v ) = Prob (E n _ ljV ) + Prob (Fj) • Prob (Ey.j) Tl-Prob (E n _ v#v )J 

(5) 

For type 1 carry propagation, C v _ \ reduces to Cj / v _ n whose .probability 
is given by equation (2) with j a v-1. Since F± has a probability of 1/2 and 
the other symbols have been defined, for this type of propagation we find 



J n' V '= P „.l.v + © - (r*) (l-Pn-vw) 



(6) 



For type 2 carry propagation C , reduces to C2 v _i whose probability is 
given by equation (3) with j = v-1. Thus we obtain for this type of propaga- 
tion, 

P n < v > * P n-l< v > + (i) (f-X 1 " P n-v < v >) < 7 > 

These are recursive relations for the probability distributions sought. 

Considering now type 1 propagation we can transform equation (6) into an 
inequality by writing 

P nM- P „-lM=iv (8) 



Suppose we form 

£3 

i a V 



I^P. (v) - P..! (v) _ 



(9) 



Since all probabilities except P n (v) and P . (v) will cancel out of the 
sum an4 since P v _i (v) equals zero (a carry sequence greater than the 
length of the adder cannot occur), we see that the entire sum equals P n (v). 
By changing the index n of equation (8) to index i and performing the sum 
over i from v to n we obtain expression (9) on the left (equal to P n (v)) and 
on the right a total of (n-v + 1) terms each equal to 1/2 V . Hence 

JL n - v + 1 
P n (v)^— ^ 

1 (10) 

Recognizing that P n (v) must be a probability and is bounded by 0<i P (v) ^_ 1, 
we can write 



<£- , , n-v_+ 1 

2 % 



P n (v) = min 1, ^ (11) 
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where the right member indicates to take either 1 or (n-v + l)/2 , 
whichever is less. This inequality will be used in determining the 
bound on the mean of the longest carry sequence. 

If P n (v) is the probability that the longest carry sequence equals or 
exceeds v, then the probability that the longest carry sequence is 
exactly equal to v is the probability that the longest carry sequence 
equals br exceeds v but does not equal or exceed v + 1. Thus we write 



Prob (1. c. s„ n a y)-= P n (v) - P n (v + 1) 



(12) 



The mean or average of a discrete random variable X that can take any 
values from 1 through n, is defined as 



X a 2_j i Prob (X * i) 

avg i a i x 



(13) 



Using this definition and letting M, represent the mean of the longest 
carry sequence we find from equation (12) that 



M, aZ i v 

1 i a 1 



Prob 



n 
(1. c. S. a v) a I 



v a 1 



[P n (v)-P n (v+l] 



(14) 



By writing a few terms of the sum as follows: 
1 • P n (l) . 1 • P n (2) 

~2* P (2) - 2. P n (3)" 

n " 

3 • P n (3) - 3 • P n (4)" 



+ 

+ 



(15) 



_n . P n ( n ) - n P n (n + li 



the reader will note that terms partially cancel. Using this fact and the 
fact that P (n + 1) a 0, we can write 



n 



M, 



'l"C, P n' V > 

V = 1 

Now substituting inequality (11) into equation (16) we find 



(16) 



M, 



C 

V a 1 



Min 



0- t^) 



(17) 
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The rest of the derivation consists in finding an upper bound for this series 
which can be taken as an upper bound for the longest carry sequence. 

In forming this sum we see that the minimum of the pair of magnitudes will 
equal unity for values up to some value of v, say v, . For vcV,, the mini- 
mum will be less than unity. . To find Vj, find bounds on v such that 

n- v+ 1 (18) 



2v ^ 1 

Thus 2 V i= n - v + 1 ( 19 ) 

This will be assured if 

2 V ^.n^2 (20) 

or if v ^ log 2 n SI (21) 

Letting log^n = a + e, where a is an integer and £= e" 1 ^. 1, we see that 
with v a flog-n + lj (brackets mean to take the integer part only), we have 

2 v 2 a + 1 ja l "» 

Restricting the discussion to n ^=2 and a^ 1, we see that the expression 
must be less than unity, although it may be almost unity for some large 
values of n. On the other hand, letting v ^frog? 1 *) s a » we see tna * 

n - v + 1 



v 



2 



Hence we may choose v, - 
Then we have 

M. ~r |log_n| + / n - v + 1 



a + 
2 


e - (a) + 1 . 2 e - (a - 1) 
2 a 2 a 


(23) 


n 


lis expression can exceed unity. 




>; 


1 = 


log 2 n + 1 


v = 


log, n + 1 



[, = [>og 2 n] 



(24) 



Consider the summation as proceeding to infinity. (This only enlarges 
the right-hand member). Since the first term is less than unity but may 
be very close to unity for some values of n; and since the ratio of successive 
terms is less than 1/2; we see that the entire sum is less than the infinite 
series 

* T -= T — 1 T • . . 
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which has a sum equal to 2. Thus we see 

M = ["log n~| +2 (25) 

Considering now type 2 carry propagation, similar arguments apply to 
derive the mean of the longest carry sequence, but wherever one had the 
factor 2 V , one replaces it with 2 V + 1 , this being the only operation needed 
to convert equation (6) into equation (7), which is the foundation of the 
analysis. Thus one can show that M,, defined as the mean of the longest 
carry sequence, is bounded as follows: 



n r n - v. + 1 ~| 

= H] min L 1 ' 2 V f l J 



M„ ±= 2_J min J 1, ,vtl I (26) 

v = 1 

For this case we seek bounds on v for which 

n - v ± 1 £ , (27) 

2 v f 1 - 1 

But this requires 

2 V + 1 = n- v+ 1 (28) 

This will be assured if 

2 V+ 1 ^ n^4 (29) 

or if v = (log 2 n) -1=1 (30) 



Thus with v a iog_n ss a + el 



a a. 



we have 



n - v + 1 s 2 a * e - (a) + 1 _ 2 6 " * - (a- 1 ) ^ x 
2 v+l 2 a + l - 2 a+l 



but with v =s a - 1 , 



Hence we can write 



M 2 = p°g 2 nj - 1 + T~! / n - v + 1 

L J v *[iog n] ( 2 v + 1 



(31) 



n - v -f 1 a 2 8 - (a - 1) + 1 a 2 e - ( a - 2) ^ 
2 v + 1 2 a ^ — 1 (32) 



(33) 
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Again comparing the summation with the infinite geometric series we 
find that it is no greater than 2. Thus, we can write 



M 2 = 



= [ l °g2 n 



+ 1 



(34) 
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